Using semi-structured data for assessing research paper similarity
نویسندگان
چکیده
Article history: Received 8 December 2011 Received in revised form 24 May 2012 Accepted 26 September 2012 Available online 6 October 2012
منابع مشابه
Trusting Semi-structured Web Data
The growth of the Web brings an uncountable amount of useful information to everybody who can access it. These data are often crowdsourced or provided by heterogenous or unknown sources, therefore they might be maliciously manipulated or unreliable. Moreover, because of their amount it is often impossible to extensively check them, and this gives rise to massive and ever growing trust issues. T...
متن کاملFingerprinting of some Egyptian rice genotypes using Intron-exon Splice Junctions (ISJ) markers
DNA fingerprinting has become an important tool for diversity assessment and varietal identification in plant breeding programs. Semi- random PCR primers targeting intron-exon splice junctions (ISJ) were used to evaluate the potential of these markers in identification and classification of rice genotypes. A total of 12 ISJ primers were used for screening fourteen Egyptian rice genotypes, inclu...
متن کاملVery Fast Similarity Queries on Semi-Structured Data from the Web
In this paper, we propose a single low-dimensional representation for entities found in different datasets on the web. Our proposed PIC-D embeddings can represent large D-partite graphs using small number of dimensions enabling fast similarity queries. Our experiments show that this representation can be constructed in small amount of time (linear in number of dimensions). We demonstrate how it...
متن کاملSimilarity and Analogy over Application Domains
Databases, particularly when storing heterogeneous, sparse semistructured data, tend to provide incomplete information and information which is difficult to categorize. This paper first considers how to classify entity instances as members of entity classes organized in a lattice-like generalization/specialization hierarchy. Then, it describes how the frame representation employed for instances...
متن کاملA Novel Method for Finding Similarities between Unordered Trees Using Matrix Data Model
Trees are capable of portraying the semi-structured data which is common in web domain. Finding similarities between trees is mandatory for several applications that deal with semi-structured data. Existing similarity methods examine a pair of trees by comparing through nodes and paths of two trees, and find the similarity between them. However, these methods provide unfavorable results for uno...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Inf. Sci.
دوره 221 شماره
صفحات -
تاریخ انتشار 2013